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GPR 25 POLYNUCLEOTIDES 

FIELD OF THE INVENTION 

This invention relates to newly identified polypeptides 
and polynucleotides encoding such polypeptides, to their use 
in therapy and in identifying compounds which may be 
agonists, antagonists and/or inhibitors which are potentially 
useful in therapy, and to production of such polypeptides and 
polynucleotides. 

BACKGROUND OF THE INVENTION 

The drug discovery process is currently undergoing a 
fundamental revolution as it embraces 'functional 
genomics', that is, high throughput genome- or gene-based 
biology. This approach is rapidly superceding earlier 
approaches based on 'positional cloning*. A phenotype, that 
is a biological function or genetic disease, would be iden- 
tified and this would then be tracked back to the responsible 
gene, based on its genetic map position. 

Functional genomics relies heavily on the various tools of 
bio informatics to identify gene sequences of potential inter- 
est from the many molecular biology databases now avail- 
able. There is a continuing need to identify and characterise 
further genes and their related polypeptides/proteins, as 
targets for drug discovery. 

It is well established that many medically significant 
biological processes are mediated by proteins participating 
in signal transduction pathways that involve G-proteins 
and/or second messengers, e.g., cAMP (Lefkowitz, Nature, 
1991, 351:353-354). Herein these proteins are referred to as 
proteins participating in pathways with G-proteins or PPG 
proteins. Some examples of these proteins include the GPC 
receptors, such as those for adrenergic agents and dopamine 
(Kobilka, B. K., et al., Proc. Natl Acad. Sci., USA, 1987, 
84:46-50; Kobilka, B. K., et al., Science, 1987, 
238:650-656; Bunzow, J. R., et al., Nature, 1988, 
336:783-787), G-proteins themselves, effector proteins, 
e.g., phospholipase C, adenyl cyclase, and 
phosphodiesterase, and actuator proteins, e.g., protein kinase 
A and protein kinase C (Simon, M. I., et al., Science, 1991, 
252:802-$). 

For example, in one form of signal transduction, the effect 
of hormone binding is activation of the enzyme, adenylate 
cyclase, inside the cell. Enzyme activation by hormones is 
dependent on the presence of the nucleotide, GTP. GTP also 
influences hormone binding. A G-protein connects the hor- 
mone receptor to adenylate cyclase. G-protein was shown to 
exchange GTP for bound GDP when activated by a hormone 
receptor. The GTP-carrying form then binds to activated 
adenylate cyclase. Hydrolysis of GTP to GDP, catalyzed by 
the G-protein itself, returns the G-protein to its basal, 
inactive form. Thus, the G-protein serves a dual role, as an 
intermediate that relays the signal from receptor to effector, 
and as a clock that controls the duration of the signal. 

The membrane protein gene superfamily of G-protein 
coupled receptors has been characterized as having seven 
putative transmembrane domains. The domains are believed 
to represent transmembrane a-helices connected by extra- 
cellular or cytoplasmic loops. G-protein coupled receptors 
include a wide range of biologically active receptors, such as 
hormone, viral, growth factor and neuroreceptors. 

G-protein coupled receptors (otherwise known as 7TM 
receptors) have been characterized as including these seven 
conserved hydrophobic stretches of about 20 to 30 amino 
acids, connecting at least eight divergent hydrophilic loops. 



10 



15 



20 



25 



30 



35 



40 



50 



55 



60 



65 



The G-protein family of coupled receptors includes dopam- 
ine receptors which bind to neuroleptic drugs used for 
treating psychotic and neurological disorders. Other 
examples of members of this family include, but are not 
limited to, calcitonin, adrenergic, endothelin, cAMP, 
adenosine, muscarinic, acetylcholine, serotonin, histamine, 
thrombin, kinin, follicle stimulating hormone, opsins, endot- 
helial differentiation gene-1, rhbdopsins, odorant, and 
cytomegalovirus receptors. Most G-protein coupled recep- 
tors have single conserved cysteine residues in each of the 
first two extracellular loops which form disulfide bonds that 
are believed to stabilize functional protein structure. The 7 
transmembrane regions are designated as TM1, TM2, TM3, 
TM4, TM5, TM6, and TM7. TM3 has been implicated in 
signal transduction. 

Phosphorylation and lipidation (palmitylation or 
farnesylation) of cysteine residues can influence signal 
transduction of some G-protein coupled receptors. Most 
G-protein coupled receptors contain potential phosphoryla- 
tion sites within the third cytoplasmic loop and/or the 
carboxy terminus. For several G-protein coupled receptors, 
such as the p-adrenoreceptor, phosphorylation by protein 
kinase A and/or specific receptor kinases mediates receptor 
desensitization. 

For some receptors, the ligand binding sites of G-protein 
coupled receptors are believed to comprise hydrophilic 
sockets formed by several G-protein coupled receptor trans- 
membrane domains, said sockets being surrounded by 
hydrophobic residues of the G-protein coupled receptors. 
The hydrophilic side of each G-protein coupled receptor 
transmembrane helix is postulated to face inward and form 
a polar ligand. binding site. TM3 has been implicated in 
several G-protein coupled receptors . as having a ligand 
binding site, such as the TM3 aspartate residue. TM5 
serines, a TM6 asparagine and TM6 or TM7 phenylalanines 
or tyrosines are also implicated in ligand binding. 

G-protein coupled receptors can be intracellular ly 
coupled by heterotrimeric G-proteins to various intracellular 
enzymes, ion channels and transporters (see, Johnson et al., 
Endoc. Rev., 1989, 10:317-331). Different G-protein 
a-subunits preferentially stimulate particular effectors to 
modulate various biological functions in a cell. Phosphory- 
lation of cytoplasmic residues of G-protein coupled recep- 
tors has been identified as an important mechanism for the 
regulation of G-protein coupling of some G-protein coupled 
receptors. G-protein coupled receptors are found in numer- 
ous sites within a mammalian host. Over the past 15 years, 
nearly 350 therapeutic agents targeting 7 transmembrane (7 
TM) receptors have been successfully introduced onto the 
market. 

SUMMARY OF THE INVENTION 

The present invention relates to GPR25, in particular 
GPR25 polypeptides and GPR25 polynucleotides, recombi- 
nant materials and methods for their production. In another 
aspect, the invention relates to methods for using such 
polypeptides and polynucleotides, including the treatment of 
infections such as bacterial, fungal, protozoan and viral 
infections, particularly infections caused by HI V-l or HIV-2, 
pain; cancers; diabetes, obesity; anorexia; bulimia; asthma; 
Parkinson's disease; acute heart failure; hypotension; hyper- 
tension; urinary retention; osteoporosis; angina pectoris; 
myocardial infarction; stroke; ulcers; asthma; allergies; 
benign prostatic hypertrophy; migraine; vomiting; psychotic 
and neurological disorders, including anxiety, 
schizophrenia, manic depression, depression, delirium, 
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dementia, and severe mental retardation; and dyskinesias, Polypeptides of the present invention can be prepared m 

such as Huntington's disease or Gilles dela Tourett's any suitable manner. Such polypeptides include isolated 

syndrome, hereinafter referred to as "the Diseases", amongst naturally occurring polypeptides, recombinantly produced 

others. In a further aspect, the invention relates to methods polypeptides, synthetically produced polypeptides, or 

for identifying agonists and antagonists/inhibitors using the 5 polypeptides produced by a combination of these methods, 

materials provided by the invention, and treating conditions Means for preparing such polypeptides are well understood 

associated with GPR25 imbalance with the identified com- in the art. 

pounds. In a still further aspect, the invention relates to In a further aspect, the present invention relates to GPR25 

diagnostic assays for detecting diseases associated with polynucleotides. Such polynucleotides include isolated 

inappropriate GPR25 activity or levels. io polynucleotides comprising a nucleotide sequence encoding 

a polypeptide which has at least 70% identity, preferably at 

DESCRIPTION OF THE INVENTION least 80% idcn|ity> more pre ferably at least 90% identity, yet 

In a first aspect, the present invention relates to GPR25 more preferably at least 95% identity, to the amino acid 

polypeptides. Such peptides include isolated polypeptides sequence of SEQ ID NO:2, over the entire length of SEQ ID 

comprising an amino acid sequence which has at least 70% 15 NO:2. In this regard, polypeptides which have at least 97% 

identity, preferably at least 80% identity, more preferably at identity are highly preferred, whilst those with at least 

least 90% identity, yet more preferably at least 95% identity, 98-99% identity are more highly preferred, and those with 

most preferably at least 97-99% identity, to that of SEQ ID at least 99% identity are most highly preferred. Such poly- 

NO-2 over the entire length of SEQ ID NO:2. Such polypep- nucleotides include a polynucleotide comprising the nucle- 

tides include those comprising the amino acid of SEQ ID 20 otide sequence contained in SEQ ID NO:l encoding the 

N0;2> polypeptide of SEQ ID NO:2. 

Further peptides of the present invention include isolated Further polynucleotides of the present invention include 

polypeptides in which the amino acid sequence has at least isolated polynucleotides comprising a nucleotide sequence 

70% identity preferably at least 80% identity, more prefer- that has at least 70% identity, preferably at least 80% 

ably at least 90% identity, yet more preferably at least 95% 25 identity, more preferably at least 90% identity, yet more 

identity most preferably at least 97-99% identity, to the preferably at least 95% identity, to a nucleotide sequence 

amino acid sequence of SEQ ID NO:2 over the entire length encoding a polypeptide of SEQ ID NO:2, over the entire 

of SEQ ID NO:2. Such polypeptides include the polypeptide coding region. In this regard, polynucleotides which have at 

of SEQ ID NO*2 least 97% identit y are highly preferred, whilst those with at 

Further peptides of the present invention include isolated 30 ^ast 98-99% identity are more highly preferred, and those 

polypeptides encoded by a polynucleotide comprising the with at least 99% identity are most highly preferred, 

sequence contained in SEQ ID NO:l. further polynucleotides of the present invention mclude 

Polypeptides of the present invention are believed to be elated ^polynucleotides comprising a nucleotide sequence 

membersofmeG-prote P incoupled,7transmembranerecep. 35 « has at least 70% identity P^^y 80% 

tor gene family of polypeptides. ITiey are therefore of identity more preferably at least 90% identrty yet more 

intJLt becau se this pVeh sequence, which was obtained Preferably at least 95% identity, to SEQ ID NO:l ^over the 

by PCR amplification of spleen cDNA, has six amino acid entire length of SEQ ID NO:l. fo this regard polymicle- 

dkrences from the published sequence for GPR25 having otides which have at least *ntty are highly preferred, 

Ger^ankAccessionNo.U91939,whichpubtishedsequenc< ; „ whnst those with at ^J^^.™™,^ 

is believed to contain errors. These properties are hereinafter *° preferred and those with at leas 99% identity are most 

referred to as "GPR25 activity" or "GPR25 polypeptide highly preferred. Such polynucleotides include a Polynucle- 

activity" or "biological activity of GPR25". Also included °tide comprising the PpjynucleoUde of SEQ ID NO.l as 

amongst these activities are antigenic and immunogenic well as the polynucleotide of SEQ ID NO:l 

activities of said GPR25 polypeptides, in particular the 45 The invention also provides polynucleotides which are 

antigenic and immunogenic activities of the polypeptide of complementary to all the above described polynucleotides. 

SEQ ID NO:2. Preferably, a polypeptide of the present The nucleotide sequence of SEQ ID NO: 1 shows homol- 

invention exhibits at least one biological activity of GPR25. ogy with GPR25 (Jung, B. P. et al, Biochem. Biophys. Res. 

The polypeptides of the present invention may be in the Commun. 230 (1), 69-72 (1997)). The nucleotide sequence 

form of the "mature" protein or may be a part of a larger so of SEQ ID NO:l is a cDNA sequence and comprises a 

protein such as a fusion protein. It is often advantageous to polypeptide encoding sequence (nucleotides 79 to 1161) 

include an additional amino acid sequence which contains encoding a polypeptide of 361 amino acids, the polypeptide 

secretory or leader sequences, pro-sequences, sequences of SEQ ID NO:2. 

which aid in purification such as multiple histidine residues, The nucleotide sequence encoding the polypeptide of 

or an additional sequence for stability during recombinant 55 SEQ ID NO:2 may be identical to the polypeptide encoding 

production. sequence contained in SEQ ID NO:l or it may be a sequence 

The present invention also includes include variants of the other than the one contained in SEQ ID NO:l, which, as a 

aforementioned polypeptides, that is polypeptides that vary result of the redundancy (degeneracy) ofjhe genetic code, 

from the referents by conservative amino acid substitutions, also encodes the polypeptide of SEQ ID NO:2. The polypep- 

whereby a residue is substituted by another with like char- 60 tide of the SEQ ID NO:2 is structurally related to other 

acteristics. Typical such substitutions are among Ala, Val, proteins of the G-protein coupled, 7 transmembrane receptor 

Leu and He, among Ser and Thr; among the acidic residues gene family, having homology and/or structural similarity 

Asp and Glu; among Asn and Gin, and among the basic with GPR25 (Jung, B. P. et at, Biochem. Biophys. Res. 

residues Lys and Arg; or aromatic residues Phe and Tyr. Commun. 230(1), 69-72 (1997)). 

Particularly preferred are variants in which several, 5-10, 65 Preferred polypeptides and polynucleotides of the present 

1-5, 1-3, 1-2 or 1 amino acids are substituted, deleted, or invention are expected to have, inter alia, similar biological 

added in any combination. functions/properties to their homologous polypeptides and 
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polynucleotides. Furthermore, preferred polypeptides and dextran sulfate, and 20 microgram/ml denatured, sheared 

polynucleotides of the present invention have at least one salmon sperm DNA; followed by washing the filters in O.lx 

GPR25 activity ssc at about 65 ° c - the P resent invention also includes 

Polynucleotidesofthep^ {^^^^ 

using standard c bning and ^reemng echmques, from a ^ ^ ^ ^ UCQC0 of SEQ ID N0 :l or a 

cDNA library derived from mRNA in cells of human Spleen, fragment thereof 

using the expressed f^^J^J^J>J^^ < A ^ an ? ' . The skilled artisan will appreciate that, in many cases, an 

M. D., et al. Science (1991) 252:1651-1656; Adams, M. D. Elated cDNA sequence will be incomplete, in that the 

et al., Nature, (1992) 355:632-634; Adams, M. D., et al., reg ion coding for the polypeptide is cut short at the 5 ( end of 

Nature (1995) 377 Supp:3-174). Polynucleotides of the w ^ e C DNA. This is a consequence of reverse transcriptase, an 

invention can also be obtained from natural sources such as enzyme with inherently low 'processivity' (a measure of the 

genomic DNA libraries or can be synthesized using well ability of the enzyme to remain attached to the template 

known and commercially available techniques. during the polymerisation reaction), failing to complete a 

When polynucleotides of the present invention are used DNA copy of the mRNA template during 1st strand cDNA 

for the recombinant production of polypeptides of the 15 synthesis. 

present invention, the polynucleotide may include the cod- There are several methods available and well known to 

ing sequence for the mature polypeptide, by itself; or the those skilled in the art to obtain full-length cDNAs, or 

coding sequence for the mature polypeptide in reading frame extend short cDNAs, for example those based on the method 

withothercodingsequences,suchasthoseencodingaleader of Rapid Amplification of cDNA ends <^Hse e , for 

or secretory sequence, a pre-, or pro- or prepro- protein 20 example, Frohman et al., PNAS USA 85, 8998-9002, 1988). 

sequence, or other fusion peptide portions. For example, a R/cent modifications of the technique, exemplified by the 

marker sequence which f abates purification of the fused Marathon™ technology (Clontech 

. T? , , , . _ . • ^c^a ^u^a; example, have significantly simplified the search for longer 

polypeptide can be encoded. In certain preferred embodi- c dnAs. In the Marathon™ technology, cDNAs have been 

ments of this aspect of the invention the marker sequence ; is is d from mRNAextracted from a chosen ^ and M 

a hexa-histidme peptide, as provided in the pQE vector < adaptor * sequence ligated onto each end. Nucleic acid 

(Qiagen, Inc.) and described in GcnizetaL,Proc Natl Acad amplification (p CR ) is lheri carried out to amplify the 

Sci USA (1989) 86:821-824, or is an HA tag. The poly- 'm^ing' $< cn d of the cDNA using a combination of gene 

nucleotide may also contain non-coding 5 f and 3' sequences, specific and adaptor specific oligonucleotide primers. The 

such as transcribed, non-translated sequences, splicing and ^ pc R rea ction is then repeated using 'nested* primers, that is, 

polyadenylation signals, ribosome binding sites and primers designed to anneal within the amplified product 

sequences that stabilize mRNA. (typically an adaptor specific primer that anneals further 3' 

further embodiments of the present invention include in the adaptor sequence and a gene specific primer that 
polynucleotides encoding polypeptide variants which com- anneals further 5' in the known gene sequence). The prod- 
prise the amino acid sequence of SEQ ID NO:2 and in which 35 ucts of this reaction can then be analysed by DNA sequenc- 
several, for instance from 5 to 10, 1 to 5, 1 to 3, 1 to 2 or 1, ing and a full-length cDNA constructed either by joining the 
amino acid residues are substituted, deleted or added, in any product directly to the existing cDNA to give a complete 
combination. sequence, or carrying out a separate full-length PCR using 

Polynucleotides which are identical or sufficiently iden- the new sequence information for the design of the 5| primer, 

tical to a nucleotide sequence contained in SEQ ID NO:l, 40 Recombinant polypeptides of the present invention may 

may be used as hybridization probes for cDNA and genomic be prepared by processes well known in the art from 

DNA or as primers for a nucleic acid amplification (PCR) genetically engineered host cells comprising expression 

reaction, to isolate full-length cDNAs and genomic clones systems. Accordingly, in a further aspect, the present inven- 

encoding polypeptides of the present invention and to isolate tion relates to expression systems which comprise a" poly- 

cDNA and genomic clones of other genes (including genes 45 nucleotide or polynucleotides of the present invention, to 

encoding homologs and orthologs from species other than host cells which are genetically engineered with such 

human) that have a high sequence similarity to SEQ ID expression sytems and to the production of polypeptides of 

NO:l. Typically these nucleotide sequences are 70% the invention by recombinant techniques. Cell-free transla- 

identical, preferably 80% identical, more preferably 90% tion systems can also be employed to produce such proteins 

identical, most preferably 95% identical to that of the 50 using RNAs derived from the DNA constructs of the present 

referent. The probes or primers will generally comprise at invention. 

least 15 nucleotides, preferably, at least 30 nucleotides and For recombinant production, host cells can be genetically , 

may have at least 50 nucleotides. Particularly preferred engineered to incorporate expression systems or portions 

probes will have between 30 and 50 nucleotides. thereof for polynucleotides of the present invention. Intro- 

A polynucleotide encoding a polypeptide of the present 55 duction of polynucleotides into host cells can be effected by 

invention, including homologs and orthologs from species methods described in many standard laboratory manuals, 

other than human, may be obtained by a process which such as Davis et al, Basic Methods in Molecular Biology 

comprises the steps of screening an appropriate library under (1986) and Sambrook et al., Molecular Cloning: A Labora- 

stringent hybridization conditions with a labeled probe hav- tory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, 

ing the sequence of SEQ ID NO:l or a fragment thereof; and 60 Cold Spring Harbor, N.Y (1989). Preferred such methods 

isolating full-length cDNA and genomic clones containing include, for instance, calcium phosphate transfection, 

said polynucleotide sequence. Such hybridization tech- DEAE-dextran mediated transfection, transvection, 

niques are well known to the skilled artisan. Preferred microinjection, cationic lipid-mediated transfection, 

stringent hybridization conditions include overnight incuba- electroporation, transduction, scrape loading, ballistic intro- 

tion at 42° C. in a solution comprising: 50% formamide, 65 duction or infection. 

5xSSC (150 mM Nad, 15 mM trisodium citrate), 50 mM Representative examples of appropriate hosts include 

sodium phosphate (pH7.6), 5x Denhardt's solution, 10% bacterial cells, such as streptococci, staphylococci, E. coU, 
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Streptomyces and Bacillus subtilis cells; fungal cells, such as Deletions and insertions can be detected by a change in size 

yeast cells and Aspergillus cells; insect cells such as Droso- of the amplified product in comparison to the normal geno- 

phila S2 and Spodoptera Sf9 cells; animal cells such as type. Point mutations can be identified by hybridizing ampU- 

CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 and Bowes fied DNA to labeled GPR25 nucleotide sequences. Perfectly 

~n c . /nri niant »l « 5 matched sequences can be distmguished from mismatched 

melanoma cells, and plant cells. 5 ^ ^ ^ ^ {n mehing 

A great variety of expression systems can be used, tor temperatures . DNA sequence differences may also be 

instance, chromosomal, episomal and virus-derived detected by alterations in electrophoretic mobility of DNA 

systems, e.g., vectors derived from bacterial plasmids, from fragments in gels, with or without denaturing agents, or by 

bacteriophage, from transposons, from yeast episomes, from ^ cci sequencing (ee, e.g., Myers et al., Science 

insertion elements, from yeast chromosomal elements, from 30 (1985)230:1242). Sequence changes at specific locations 

viruses such as baculoviruses, papova viruses, such as mav a is 0 be revealed by nuclease protection assays, such as 

SV40, vaccinia viruses, adenoviruses, fowl pox viruses, RNase and SI protection or the chemical cleavage method 

pseudorabies viruses and retroviruses, and vectors derived ( se e Cotton et al., Proc Natl Acad Sci USA (1985) 

from combinations thereof, such as those derived from 85:4397-4401). In another embodiment, an array of oligo- 

plasmid and bacteriophage genetic elements, such as 15 nucleotides probes comprising GPR25 nucleotide sequence 

cosmids and phagemids. The expression systems may con- or fragments thereof can be constructed to conduct efficient 

tain control regions that regulate as well as engender expres- screening of e.g., genetic mutations. Array technology meth- 

sion. Generally, any system or vector which is able to 0 ds are well known and have general applicability and can 

maintain, propagate or express a polynucleotide to produce De use d to address a variety of questions in molecular 

a polypeptide in a host may be used. The appropriate 20 genetics including gene expression, genetic linkage, and 

nucleotide sequence may be inserted into an expression genetic variability (see for example: M. Chee et al., Science, 

system by any of a variety of well-known and routine Vol 274, pp 610-613 (1996)). 

techniques, such as, for example, those set forth in Sam- diagnostic assays offer a process for diagnosing or 

brook et al., MOLECULAR CLONING, A LABORATORY determining a susceptibility to the Diseases through detec- 

MANUAL (supra). Appropriate secretion signals may be 25 { { on 0 f mu tation in the GPR25 gene by the methods 

incorporated into the desired polypeptide to allow secretion described. In addition, such diseases may be diagnosed by 

of the translated protein into the lumen of the endoplasmic methods comprising determining from a sample derived 

reticulum, the periplasmic space or the extracellular envi- g. om a subject an abnormally decreased or increased level of 

ronment. These signals may be endogenous to the polypep- polypeptide or mRNA. Decreased or increased expression 

tide or they may be heterologous signals, 30 can be measured at the RNA level using any of the methods 

If a polypeptide of the present invention is to be expressed well known in the art for the quantitation of polynucleotides, 

for use in screening assays, it is generally preferred that the such as, for example, nucleic acid amplification, for instance 

polypeptide be produced at the surface of the cell. In this PCR, RT-PCR, RNase protection, Northern blotting and 

event, the cells may be harvested prior to use in the other hybridization methods. Assay techniques that can be 

screening assay. If the polypeptide is secreted into the 35 used to determine levels of a protein, such as a polypeptide 

medium, the medium can be recovered in order to recover of the present invention, in a sample derived from a host are 

and purify the polypeptide. If produced intracellularly, the well-known to those of skill in the art. Such assay methods 

cells must first be lysed before the polypeptide is recovered. include radioimmunoassays, competitive-binding assays, 

Polypeptides of the present invention can be recovered ^ Western Blot analysis and ELISA assays, 

and purified from recombinant cell cultures by well-known Thus in another aspect, the present invention relates to a 

methods including ammonium sulfate or ethanol diagonostic kit which comprises: 

precipitation, acid extraction, anion or cation exchange (a) a polynucleotide of the present invention, preferably 

chromatography, phosphocellulose chromatography, hydro- the nucleotide sequence of SEQ ID NO: 1, or a fragment 

phobic interaction chromatography, affinity 45 thereof; 

chromatography, hydroxylapatite chromatography and lec- (fc>) a nucleotide sequence complementary to that of (a); 

tin chromatography. Most preferably, high performance liq- ( c ) a polypeptide of the present invention, preferably the 

uid chromatography is employed for purification. Well polypeptide of SEQ ID NO:2 or a fragment thereof; or 

known techniques for refolding proteins may be employed ^ an ant foody to a polypeptide of the present invention, 

to regenerate active conformation when the polypeptide is 5Q preferably to the polypeptide of SEQ ID NO:2. 

denatured during isolation and or purification. j t ^ ^ a pp rec iated that in any such kit, (a), (b), (c) or 

This invention also relates to the use of polynucleotides of (d) may comprise a substantial component. Such a kit will 
the present invention as diagnostic reagents. Detection of a De of use in diagnosing a disease or suspectability to a 
mutated form of the gene characterised by the polynucle- disease, particularly infections such as bacterial, fungal, 
otide of SEQ ID NO:l which is associated with a dysfunc- 55 protozoan and viral infections, particularly infections caused 
lion will provide a diagnostic tool that can add to, or define, by HIV-1 or HIV-2; pain; cancers; diabetes, obesity; anor- 
a diagnosis of a disease, or susceptibility to a disease, which exia; bulimia; asthma; Parkinson's disease; acute heart fail- 
results from under-expression, over-expression or altered ure; hypotension; hypertension; urinary retention; 
expression of the gene. Individuals carrying mutations in the osteoporosis; angina pectoris; myocardial infarction; stroke; 
gene may be detected at the DNA level by a variety of 60 ulcers; asthma; allergies; benign prostatic hypertrophy; 
techniques. migraine; vomiting; psychotic and neurological disorders, 

Nucleic acids for diagnosis may be obtained from a including anxiety, schizophrenia, manic depression, 

subject's cells, such as from blood, urine, saliva, tissue depression, delirium, dementia, and severe mental retarda- 

biopsy or autopsy material. The genomic DNA may be used tion; and dyskinesias, such as Huntington's disease or Gilles 

directly for detection or may be amplified enzymatically by 65 dela Tourett's syndrome, amongst others, 

using PCR or other amplification techniques prior to analy- The nucleotide sequences of the present invention are also 

sis. RNA or cDNA may also be used in similar fashion. valuable for chromosome identification. The sequence is 
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specifically targeted to, and can hybridize with, a particular 
location on an individual human chromosome. The mapping 
of relevant sequences to chromosomes according to the 
present invention is an important first step in correlating 
those sequences with gene associated disease. Once a 
sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the chro- 
mosome can be correlated with genetic map data. Such data 
are found in, for example, V. McKusick Mendelian Inher- 
itance in Man (available on-line through Johns Hopkins 
University Welch Medical Library). The relationship 
between genes and diseases that have been mapped to the 
same chromosomal region are then identified through link- 
age analysis (coinheritance of physically adjacent genes). 

The differences in the cDNA or genomic sequence 
between affected and unaffected individuals can also be 
determined. If a mutation is observed in some or all of the 
affected individuals but not in any normal individuals, then 
the mutation is likely to be the causative agent of the disease. 

The polypeptides of the invention or their fragments or 
analogs thereof, or cells expressing them, can also be used 
as immunogens to produce antibodies immunospecific for 
polypeptides of the present invention. Hie term "immuno- 
specific" means that the antibodies have substantially greater 
affinity for the polypeptides of the invention than their 
affinity for other related polypeptides in the prior art. 

Antibodies generated against polypeptides of the present 
invention may be obtained by administering the polypep- 
tides or epitope-bearing fragments, analogs or cells to an 
animal, preferably a non-human animal, using routine pro- 
tocols. For preparation of monoclonal antibodies, any tech- 
nique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the hybri- 
doma technique (Kohler, G. and Milstein, C, Nature (1975) 
256:495-497), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al., Immunology Today 35 
(1983) 4:72) and the EBV-hybridoma technique (Cole et al., 
MONOCLONAL ANTIBODIES AND CANCER 
THERAPY, pp. 77-96, Alan R. Liss, Inc., 1985). 

Techniques for the production of single chain antibodies, 
such as those described in U.S. Pat. No. 4,946,778, can also 
be adapted to produce single chain antibodies to polypep- 
tides of this invention. Also, transgenic mice, or other 
organisms, including other mammals, may be used to 
express humanized antibodies. 

The above-described antibodies may be employed to 
isolate or to identify clones expressing the polypeptide or to 
purify the polypeptides by affinity chromatography. 

Antibodies against polypeptides of the present invention 
may also be employed to treat the Diseases, amongst others. 

In a further aspect, the present invention relates to geneti- 
cally engineered soluble fusion proteins comprising a 
polypeptide of the present invention, or a fragment thereof, 
and various portions of the constant regions of heavy or light 
chains of immunoglobulins of various subclasses (IgG, IgM, 
IgA, IgE). Preferred as an immunoglobulins is the constant 
part of the heavy chain of human IgG, particularly IgGl, 
where fusion takes place at the hinge region. In a particular 
embodiment, the Fc part can be removed simply by incor- 
poration of a cleavage sequence which can be cleaved with 
blood clotting factor Xa. Furthermore, this invention relates 
to processes for the preparation of these fusion proteins by 
genetic engineering, and to the use thereof for drug 
screening, diagnosis and therapy. A further aspect of the 
invention also relates to polynucleotides encoding such 
fusion proteins. Examples of fusion protein technology can 
be found in International Patent Application Nos. W094/ 
29458 and W094/22914. 



Another aspect of the invention relates to a method for 
inducing an immunological response in a mammal which 
comprises inoculating the mammal with a polypeptide of the 
present invention, adequate to produce antibody and/or T 
cell immune response to protect said animal from the 
Diseases hereinbefore mentioned, amongst others. Yet 
another aspect of the invention relates to a method of 
inducing immunological response in a mammal which 
comprises, delivering a polypeptide of the present invention 
via a vector directing expression of the polynucleotide and 
coding for the polypeptide in vivo in order to induce such an 
immunological response to produce antibody to protect said 
animal from diseases. 

A further aspect of the invention relates to an 
immunological/vaccine formulation (composition) which, 
when introduced into a mammalian host, induces an immu- 
nological response in that mammal to a polypeptide of the 
present invention wherein the composition comprises a 
polypeptide or polynucleotide of the present invention. The 
20 vaccine formulation may further comprise a suitable carrier. 
Since a polypeptide may be broken down in the stomach, it 
is preferably administered parenterally (for instance, 
subcutaneous, intramuscular, intravenous, or intradermal 
injection). Formulations suitable for parenteral administra- 
tion include aqueous and non-aqueous sterile injection solu- 
tions which may contain antioxidants, buffers, bacteriostats 
and solutes which render the formulation instonic with the 
blood of the recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents or thick- 
ening agents. The formulations may be presented in unit- 
dose or multi-dose containers, for example, sealed ampoules 
and vials and may be stored in a freeze-dried condition 
requiring only the addition of the sterile liquid carrier 
immediately prior to use. The vaccine formulation may also 
include adjuvant systems for enhancing the immunogenicity 
of the formulation, such as oil-in water systems and other 
systems known in the art. The dosage will depend on the 
specific activity of the vaccine and can be readily deter- 
mined by routine experimentation. 

Polypeptides of the present invention are responsible for 
many biological functions, including many disease states, in 
particular the Diseases hereinbefore mentioned. It is there- 
fore desirous to devise screening methods to identify com- 
pounds which stimulate or which inhibit the function of the 
polypeptide. Accordingly, in a further aspect, the present 
invention provides for a method of screening compounds to 
identify those which stimulate or which inhibit the function 
of the polypeptide. In general, agonists or antagonists may 
be employed for therapeutic and prophylactic purposes for 
such Diseases as hereinbefore mentioned. Compounds may 
be identified from a variety of sources, for example, cells, 
cell-free preparations, chemical libraries, and natural prod- 
uct mixtures. Such agonists, antagonists or inhibitors 
so-identified may be natural or modified substrates, ligands, 
receptors, enzymes, etc., as the case may be, of the polypep- 
tide; or may be structural or functional mimetics thereof (see 
Coligan et al., Current Protocols in Immunology 
l(2):Chapter 5 (1991)). 

The screening method may simply measure the binding of. 
a candidate compound to the polypeptide, or to cells or 
membranes bearing the polypeptide, or a fusion protein 
thereof by means of a label directly or indirectly associated 
with the candidate compound. Alternatively, the screening 
method may involve competition with a labeled competitor. 
Further, these screening methods may test whether the 
candidate compound results in a signal generated by acti- 
vation or inhibition of the polypeptide, using detection 
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systems appropriate to the cells bearing the polypeptide. It will be appreciated that in any such kit, (a), (b), (c) or 

Inhibitors of activation are generally assayed in the presence (d) may comprise a substantial component. 

of a known agonist and the effect on activation by the It will be readily appreciated by the skilled artisan that a 

agonist by the presence of the candidate compound is polypeptide of the present invention may also be used in a 

observed. Constitutively active polypeptides may be 5 method for the structure-based design of an agonist, antago- 

employed in screening methods for inverse agonists or nist or inhibitor of the polypeptide, by: 

inhibitors, in the absence of an agonist or inhibitor, by (a) determining in the first instance the three-dimensional 

testing whether the candidate compound results in inhibition structure of the polypeptide; 

of activation of the polypeptide. Further, the screening ( b ) deducing the three-dimensional structure for the likely 
methods may simply comprise the steps of mixing a candi- 1Q reactive or binding site(s) of an agonist, antagonist or 
date compound with a solution containing a polypeptide of inhibitor; 

the present invention, to form a mixture, measuring GPR25 ( c ) S ynthesing candidate compounds that are predicted to 
activity in the mixture, and comparing the GPR25 activity of bind to or react ^ the deduced binding or reactive 

the mixture to a standard. Fusion proteins, such as those s | te; ^ 

made from Fc portion and GPR25 polypeptide, as herein- J5 (d) te J ting whether ^ cand i d ate compounds are indeed 
before described, can also be used for high-throughput agonists, antagonists or inhibitors, 

screening assays to identify antagonists for the polypeptide n ^ be furthef appreciated tnat mis normally be an 

of the present invention (see D. Bennett et al., J Mol iterative process. 

Recognition, 8:52-58 (1995); and K. Johanson et al, J Biol In a the present inveQtion provides meth- 

Chem, 270(16):9459-9471 (1995)). 2Q ods of treaung abnormal conditions such as, for instance, 

The polynucleotides, polypeptides and antibodies to the m f ections such ^ bacterial, fungal, protozoan and viral 

polypeptide of the present invention may also be used to infec tions, particularly infections caused by HIV-1 or HIV- 

configure screening methods for detecting the effect of 2; pain; sticks; diabetes, obesity; anorexia; bulimia; 

added compounds on the production of mRNAand polypep- asthma; Parkinson > s disease; acute heart failure; hypoten- 

tide in cells. For example, an ELISA assay may be con- 25 sion; hypertension; retention; osteoporosis; angina 

structed for measuring secreted or cell associated levels of pectoris; myocardial infarction; stroke; ulcers; asthma; aller- 

polypeptide using monoclonal and polyclonal antibodies by gies; benigQ prostatic hypertrophy; migraine; vomiting; psy- 

standard methods known in the art. This can be used to chotic and neU rological disorders, including anxiety, 

discover agents which may inhibit or enhance the production schizophrenia, manic depression, depression, delirium, 

of polypeptide (also called antagonist or agonist, 3(J demen tia, ^ se vere mental retardation; and dyskinesias, 

respectively) from suitably manipulated cells or tissues. such as Huntington ' s disease or G illes dela Tourett's 

The polypeptide may be used to identify membrane bound syndrome , rc i ated t0 eit h e r an excess of, or an under- 
or soluble receptors, if any, through standard receptor bind- expression of, GPR25 polypeptide activity, 
ing techniques known in the art. These include, but are not lf the activity of the polypeptide is in excess, several 
limited to, ligand binding and crosslinking assays in which 35 a p proac hes are available. One approach comprises admin- 
the polypeptide is labeled with a radioactive isotope (for ^ring to a subject in need thereof an inhibitor compound 
instance, I), chemically modified (for instance, ( aDt agonist) as hereinabove described, optionaUy in combi- 
biotinylated), or fused to a peptide sequence suitable for nation ^ a pharmaceutical^ acceptable carrier, in an 
detection or purification, and incubated with a source of the amount effective to inhibit the faction of the polypeptide, 
putative receptor (cells, cell membranes, cell supernatants, ^ such agj for example> oy blocking the binding of ligands, 
tissue extracts, bodily fluids). Other methods include bio- substrates, receptors, enzymes, etc., or by inhibiting a sec- 
physical techniques such as surface plasmon resonance and Qnd signal> and mereby the abnormal condition, 
spectroscopy. These screening methods may also be used to Tq anolhef approach , soluble forms of the polypeptides still 
identify agonists and antagonists of the polypeptide which capable of binding the ligand> substrate , enzymes, receptors, 
compete with the binding of the polypeptide to its receptors, 45 etc jn compelition ^th endogenous polypeptide may be 
if any. Standard methods for conducting such assays are well adm inistered. Typical examples of such competitors include 
understood in the art. fragments of the GPR25 polypeptide. 

Examples of potential polypeptide antagonists include , Q stfu aQOther approach> expression of the gene encoding 

antibodies or, in some cases, oligonucleotides or proteins endogenous GPR25 polypeptide can be inhibited using 

which are closely related to the ligands, substrates, 5Q expression b i ock in g techniques. Known such techniques 
receptors, enzymes, etc., as the case may be, of the (he ^ of ntiseasc sequences, either internally 

polypeptide, e.g., a fragment of the ligands, substrates, generat ed or separately administered (see, for example, 

receptors, enzymes, etc.; or small molecules which bind to O'Connor, JNeurochem (1991) 56:560 in Oligodeoxynucle- 

the polypeptide of the present invention but do not elicit a otides as Mi ^ nsc inhibitors of Gene Expression, CRC 

response, so that the activity of the polypeptide is prevented. 55 Pfess> Boca Rat0D) ^ ( 198 g)). Alternatively, oligonucle- 

Thus, in another aspect, the present invention relates to a otkJes which form triple helices ^ thc gene can be 

screening kit for identifying agonists, antagonists, ligands, supplied (see, for example, Lee et al., Nucleic Acids Res 

receptors, substrates, enzymes, etc. for polypeptides of the (1979) 6:3m . CooQey et aL> Science (1988)241:456; Der- 

present invention; or compounds which decrease or enhance van et al Science (1991)251:1360). These oligomers can be 

the production of such polypeptides, which comprises: fi(J admin istered per se or the relevant oligomers can be 

(a) a polypeptide of the present invention; expressed in vivo. 

(b) a recombinant cell expressing a polypeptide of the For treating abnormal conditions related to an under- 
present invention; expression of GPR25 and its activity, several approaches are 

(c) a cell membrane expressing a polypeptide of the also available. One approach comprises administering to a 
present invention; or 65 subject a therapeutically effective amount of a compound 

(d) antibody to a polypeptide of the present invention; which activates a polypeptide of the present invention, i.e., 
which polypeptide is preferably that of SEQ ID NO:2. an agonist as described above, in combination with a phar- 
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maceutically acceptable carrier, to thereby alleviate the 
abnormal condition. Alternatively, gene therapy may be 
employed to effect the endogenous production of GPR25 by 
the relevant cells in the subject. For example, a polynucle- 
otide of the invention may be engineered for expression in 5 
a replication defective retroviral vector, as discussed above. 
The retroviral expression construct may then be isolated and 
introduced into a packaging cell transduced with a retroviral 
plasmid vector containing RNA encoding a polypeptide of 
the present invention such that the packaging cell now 10 
produces infectious viral particles containing the gene of 
interest. These producer cells may be administered to a 
subject for engineering cells in vivo and expression of the 
polypeptide in vivo. For an overview of gene therapy, see 
Chapter 20, Gene Therapy and other Molecular Genetic- 15 
based Therapeutic Approaches, (and references cited 
therein) in Human Molecular Genetics, T Strachan and A P 
Read, BIOS Scientific Publishers Ltd (1996). Another 
approach is to administer a therapeutic amount of a polypep- 
tide of the present invention in combination with a suitable 20 
pharmaceutical carrier. 

In a further aspect, the present invention provides for 
pharmaceutical compositions comprising a therapeutically 
effective amount of a polypeptide, such as the soluble form 
of a polypeptide of the present invention, agonist/antagonist 25 
peptide or small molecule compound, in combination with a 
pharmaceutically acceptable carrier or excipient. Such car- 
riers include, but are not limited to, saline, buffered saline, 
dextrose, water, glycerol, ethanol, and combinations thereof. 
The invention further relates to pharmaceutical packs and 30 
kits comprising one or more containers filled with one or 
more of the ingredients of the aforementioned compositions 
of the invention. Polypeptides and other compounds of the 
present invention may be employed alone or in conjunction 
with other compounds, such as therapeutic compounds. 35 

The composition will be adapted to the route of 
administration, for instance by a systemic or an oral route. 
Preferred forms of systemic administration include 
injection, typically by intravenous injection. Other injection 
routes, such as subcutaneous, intramuscular, or 40 
intraperitoneal, can be used. Alternative means for systemic 
administration include transmucosal and transdermal admin- 
istration using penetrants such as bile salts or fusidic acids 
or other detergents. In addition, if a polypeptide or other 
compounds of the present invention can be formulated in an 45 
enteric or an encapsulated formulation, oral administration 
may also be possible. Administration of these compounds 
may also be topical and/or localized, in the form of salves, 
pastes, gels, and the like. 

The dosage range required depends on the choice of 50 
peptide or other compounds of the present invention, the 
route of administration, the nature of the formulation, the 
nature of the subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however; are in the 
range of 0.1-100 /*g/kg of subject. Wide variations in the 55 
needed dosage, however, are to be expected in view of the 
variety of compounds available and the differing efficiencies 
of various routes of administration. For example, oral 
administration would be expected to require higher dosages 
than administration by intravenous injection. Variations in 60 
these dosage levels can be adjusted using standard empirical 
routines for optimization, as is well understood in the art. 

Polypeptides used in treatment can also be generated 
endogenously in the subject, in treatment modalities often 
referred to as "gene therapy" as described above. Thus, for 65 
example, cells from a subject may be engineered with a 
polynucleotide, such as a DNA or RNA, to encode a 



polypeptide ex vivo, and for example, by the use of a 
retroviral plasmid vector. The cells are then introduced into 
the subject. 

Polynucleotide and polypeptide sequences form a valu- 
able information resource with which to identify further 
sequences of similar homology. This is most easily facili- 
tated by storing the sequence in a computer readable 
medium and then using the stored data to search a sequence 
database using well known searching tools, such as GCC. 
Accordingly, in a further aspect, the present invention pro- 
vides for a computer readable medium having stored thereon 
a polynucleotide comprising the sequence of SEQ ID NO:l 
and/or a polypeptide sequence encoded thereby. 

The following definitions are provided to facilitate under- 
standing of certain terms used frequently hereinbefore. 

"Antibodies" as used herein includes polyclonal and 
monoclonal antibodies, chimeric, single chain, and human- 
ized antibodies, as well as Fab fragments, including the 
products of an Fab or other immunoglobulin expression 
library. 

"Isolated" means altered "by the hand of man" from the 
natural state. If an "isolated" composition or substance 
occurs in nature, it has been changed or removed from its 
original environment, or both. For example, a polynucle- 
otide or a polypeptide naturally present in a living animal is 
not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is 
"isolated", as the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucle- 
otide or polydeoxribonucleotide, which may be unmodified 
RNA or DNA or modified RNA or DNA. "Polynucleotides" 
include, without limitation, single- and double-stranded . 
DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is 
mixture of single- and double-stranded regions, hybrid mol- 
ecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded or a mixture of 
single- and double-stranded regions. In addition, "poly- 
nucleotide" refers to triple-stranded regions comprising 
RNA or DNA or both RNA and DNA. The term "polynucle- 
otide" also includes DNAs or RNAs containing one or more 
modified bases and DNAs or RNAs with backbones modi- 
fied for stability or for other reasons. "Modified" bases 
include, for example, tritylated bases and unusual bases such 
as inosine. A variety of modifications may be made to DNA 
and RNA; thus, "polynucleotide" embraces chemically, 
enzymatically or metabolically modified forms of poly- 
nucleotides as typically found in nature, as well as the 
chemical forms of DNA and RNA characteristic of viruses 
and cells. "Polynucleotide" also embraces relatively short 
polynucleotides, often referred to as oligonucleotides. 

"Polypeptide" refers to any peptide or protein comprising 
two or more amino acids joined to each other by peptide 
bonds or modified peptide bonds, i.e., peptide isosteres. 
"Polypeptide" refers to both short chains, commonly 
referred to as peptides, oligopeptides or oligomers, and to 
longer chains, generally referred to as proteins. Polypeptides 
may contain amino acids other than the 20 gene-encoded 
amino acids. "Polypeptides" include amino acid sequences 
modified either by natural processes, such as post- 
translational processing, or by chemical modification tech- 
niques which are well known in the art. Such modifications 
are well described in basic texts and in more detailed 
monographs, as well as in a voluminous research literature. 
Modifications may occur anywhere in a polypeptide, includ- 
ing the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. It will be appreciated that the 
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same type of modification may be present to the same or PART I, Griffin, A. M, and Griffin, H. G., eds., Humana 
varying degrees at several sites in a given polypeptide. Also, Press, New Jersey, 1994; SEQUENCE ANALYSIS IN 
a given polypeptide may contain many types of modifica- MOLECULAR BIOLOGY, von Hcinje, G., Academic 
tions. Polypeptides may be branched as a result of Press, 1987; and SEQUENCE ANALYSIS PRIMER, 
ubiquitination, and they may be cyclic, with or without 5 Gribskov, M. and Devereux, J., eds., M Stockton Press, New 
branching. Cyclic, branched and branched cyclic polypep- York, 1991). While there exist a number of methods to 
tides may result from post-translation natural processes or measure identity between two polynucleotide or polypeptide 
may be made by synthetic methods. Modifications include sequences, the term "identity" is well known to skilled 
acetylation, acylation, ADP-ribosylation, amidation, cova- artisans (Carillo, H., and Lipton, D., SIAM J Applied Math 
lent attachment of flavin, covalent attachment of a heme 10 (1988) 48:1073). Methods commonly employed to deter- 
moiety, covalent attachment of a nucleotide or nucleotide mine identity or similarity between two sequences include, 
derivative, covalent attachment of a lipid or lipid derivative, but are not limited to, those disclosed in Guide to Huge 
covalent attachment of phosphotidylinositol, cross-linking, Computers, Martin J. Bishop, ed., Academic Press, San 
cyclization, disulfide bond formation, demethylation, for- Diego, 1994, and Carillo, H., and Lipton, D., SIAM J 
mation of covalent cross-links, formation of cystine, forma- 15 Applied Math (1988) 48:1073. Methods to determine iden- 
tion of pyroglutamate, formylation, gamma-carboxylation, tity and similarity are codified in computer programs. Pre- 
glycosylation, GPI anchor formation, hydroxylation, ferred computer program methods to determine identity and 
iodination, methylation, myristoylation, oxidation, pro- similarity between two sequences include, but are not lim- 
teolytic processing, phosphorylation, prenylation, ited to, GCG program package (Devereux, J., et al., Nucleic 
racemization, selenoylation, sulfation, transfer-RNA medi- 20 Acids Research (1984) 12(1):387), BLASTP, BLASTN, and 
ated addition of amino acids to proteins such as arginylation, FASTA (Atschul, S. F. et al.,/ MolecBiol (1990) 215:403). 
and ubiquitination (see, for instance, PROTEINS— By way of example, a polynucleotide sequence of the 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd present invention may be identical to the reference sequence 
Ed., T. E. Creighton, W. H. Freeman and Company, New of SEQ ID NO:l, that is be 100% identical, or it may include 
York, 1993; Wold, E, Post-translational Protein Modifica- 25 up to a certain integer number of nucleotide alterations as 
tions: Perspectives and Prospects, pgs. 1-12 in POST- compared to the reference sequence. Such alterations are 
TRANSLATIONAL COVALENT MODIFICATION OF selected from the group consisting of at least one nucleotide 
PROTEINS, B. C. Johnson, Ed., Academic Press, New deletion, substitution, including transition and transversion, 
York, 1983;' Seifter et al., "Analysis for protein modifica- or insertion, and wherein said alterations may occur at the 5 f 
tions and nonprotein cofactors", Meth Ertzymol (1990) 30 or 3' terminal positions of the reference nucleotide sequence 
182:626-646 and Rattan et al, "Protein Synthesis: Post- or anywhere between those terminal positions, interspersed 
translational Modifications and Aging", Arm NY Acad Set either individually among the nucleotides in the reference 
(1992) 663:48-62). sequence or in one or more contiguous groups within the 
"Variant" refers to a polynucleotide or polypeptide that reference sequence. The number of nucleotide alterations is 
differs from a reference polynucleotide or polypeptide, but 35 determined by multiplying the total number of nucleotides in 
retains essential properties. A typical variant of a polynucle- SEQ ID NO:l by the numerical percent of the respective 
otide differs in nucleotide sequence from another, reference percent identity (divided by 100) and subtracting that prod- 
poly nucleotide. Changes in the nucleotide sequence of the uct from said total number of nucleotides in SEQ ID NO:l, 
variant may or may not alter the amino acid sequence of a or: 
polypeptide encoded by the reference polynucleotide. 40 n ^ x _^ ^ 
Nucleotide changes may result in amino acid substitutions, " 

additions, deletions, fusions and truncations in the polypep- wherein n„ is the number of nucleotide alterations, is the 

tide encoded by the reference sequence, as discussed below. total number of nucleotides in SEQ ID NO:l, and y is 0.50 

A typical variant of a polypeptide differs in amino acid for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 

sequence from another, reference polypeptide. Generally, 45 85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for. 

differences are limited so that the sequences of the reference 100%, and wherein any non-integer product of x„ and y is 

polypeptide and the variant are closely similar overall and, rounded down to the nearest integer prior to subtracting it 

in many regions, identical. A variant and reference polypep- from x„. Alterations of a polynucleotide sequence encoding 

tide may differ in amino acid sequence by one or more the polypeptide of SEQ ID NO:2 may create nonsense, 

substitutions, additions, deletions in any combination. A 50 missense or frameshift mutations in this coding sequence 

substituted or inserted amino acid residue may or may not be and thereby alter the polypeptide encoded by the polynucle- 

one encoded by the genetic code. A variant of a polynucle- otide following such alterations. 

otide or polypeptide may be a naturally occurring such as an Similarly, a polypeptide sequence of the present invention 
allelic variant, or it may be a variant that is not known to may be identical to the reference sequence of SEQ ID NO:2, 
occur naturally. Non-naturally occurring variants of poly- 55 that is be 100% identical, or it may include up to a certain 
nucleotides and polypeptides may be made by mutagenesis integer number of amino acid alterations as compared to the 
techniques or by direct synthesis. reference sequence such that the % identity is less than 
"Identity" is a measure of the identity of nucleotide 100%. Such alterations are selected from the group consist- 
sequences or amino acid sequences. In general, the ing of at least one amino acid deletion, substitution, includ- 
sequences are aligned so that the highest order match is 60 ing conservative and non-conservative substitution, or 
obtained. "Identity" per se has an art-recognized meaning insertion, and wherein said alterations may occur at the 
and can be calculated using published techniques (see, e.g.: amino- or carboxy-terminal positions of the reference 
COMPUTATIONAL MOLECULAR BIOLOGY, Lesk, A. polypeptide sequence or anywhere between those terminal 
M., ed., Oxford University Press, New York, 1988; BIO- positions, interspersed either individually among the amino 
COMPUTING: INFORMATICS AND GENOME 65 acids in the reference sequence or in one or more contiguous 
PROJECTS, Smith, D. W., ed., Academic Press, New York, groups within the reference sequence. The number of amino 
1993; COMPUTER ANALYSIS OF SEQUENCE DATA, acid alterations for a given % identity is determined by 
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multiplying the total number of amino acids in SEQ ID throughput format. The purified ligand for a receptor is 

NO: 2 by the numerical percent of the respective percent radiolabeled to high specific activity (50-2000 Ci/mmol) for 

identity (divided by 100) and then subtracting that product binding studies. A determination is then made that the 

from said total number of amino acids in SEQ ID NO: 2, or: process of radiolabeling does not diminish the activity of the 

h.Sx.-CvjO, 5 tig an< * towar< & its receptor. Assay conditions for buffers, 

m . c ■ n « :**u- ioos» pH and other modulators such as nucleotides are 

wherein n is the number of ammo acid alterations, x rt is the »■ • j * * u , , , - , 4 _ • t - r 

total number of amino acids in SEQ ID NO:2, and y is, for °P T**? ^ if 7 t T v 

instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., and both membra f *> d «« rece P t0 / purees For these 

wherein any non-integer product of x a and y is rounded assays, specific receptor binding is defined as total I associ- 

down to the nearest integer prior to subtracting it from x fl . " ated radioactivity minus the radioactivity measured m the 

"Fusion protein" refers to a protein encoded by two, often presence of an excess of unlabeled competing ligand. Where 

unrelated, fused genes or fragments thereof. In one example, possible, more than one competing ligand is used to define 

EP-A-0 464 discloses fusion proteins comprising various residual nonspecific binding. 

portions of constant region of immunoglobulin molecules 

together with another human protein or part thereof. In many 15 Example 4 
cases, employing an immunoglobulin Fc region as a part of 

a fusion protein is advantageous for use in therapy and Functional Assay in Xenopus Oocytes 

diagnosis resulting in, for example, improved pharmacoki- Capped RNA transcripts from linearized plasmid tem- 

netic properties [see, e.g., EP-A 0232 262]. On the other plates encoding th e receptor cDNAs of the invention are 

hand, for some uses it would be desirable to be able to delete 20 synthesized in vitro with RNA polymerases in accordance 

the Fc part after the fusion protein has been expressed, sta ndard procedures. In vitro transcripts are suspended 

detected and purified. in water at a final concentration of 0.2 mg/ml. Ovarian lobes 

All publications, including but not limited to patents and are removed from adult female toads, Stage V defolliculated 

patent applications, cited in this specification are herein oocytes are obtained, and RNA transcripts (10 ng/oocyte) 

incorporated by reference as if each individual publication 25 are injected in a 50 nl bolus using a microinjection appara- 

were specifically and individually indicated to be incorpo- tus. Two electrode voltage clamps are used to measure the 

rated by reference herein as though fully set forth. currents from individual Xenopus oocytes in response to 

agonist exposure. Recordings are made in Ca2+ free Barth's 

EXAMPLES medium at room temperature. The Xenopus system can be 

Example 1 30 use ^ to screen known ligands and tissue/cell extracts for 

^ „ „ . activating ligands. 
Mammalian Cell Expression 

Die receptors of the present invention are expressed in Example 5 
either human embryonic kidney 293 (HEK293) cells or 

adherent dhfr CHO cells. To maximize receptor expression, 35 Microphysiometric Assays 

typically all 5' and 3' untranslated regions (UTRs) are Activation 0 f a wide variety of secondary messenger 

removed from the receptor cDNA prior to insertion into a m ^ extrusion of small amounts of acid from 

pCDN or pCDNA3 vector. The cells are transfected with a ceU ^ add fofmed fe largdy as a resuh of me mcreased 

individual receptor cDNAs by lipofectin and selected in the metabolic activity required to fuel the intracellular signaling 

presence of 400 mg/ml G418. After 3 weeks of selection, ^ p roC ess. The pH changes in the media surrounding the cell 

individual clones are picked and expanded for further analy- are very sma ji t, ut are detectable by the CYTOSENSOR 

sis. HEK293 or CHO cells transfected with the vector alone microphysiometer (Molecular Devices Ltd., Menlo Park, 

serve as negative controls. To isolate cell lines stably Calif.). The CYTOSENSOR is thus capable of detecting the 

expressing the individual receptors, about 24 clones are activation of a receptor which is coupled to an energy 

typically selected and analyzed by Northern blot analysis. 45 utilizing intracellular signaling pathway such as the 

Receptor mRNAs are generally detectable in about 50% of G-protein coupled receptor of the present invention, 
the G418-resistant clones analyzed. 

Example 2 

Ligand bank for binding and functional assays 5Q Extract/Cell Supernatant Screening 

A bank of over 200 putative receptor ligands has been ^ large number of mammalian receptors exist for which 

assembled for screening. The bank comprises: transmitters, there remains, as yet, no cognate activating ligand (agonist), 

hormones and chemokines known to act via a human seven Thus, active ligands for these receptors may not be included 

transmembrane (7TM) receptor; naturally occurring com- within the ligands banks as identified to date. Accordingly, 

pounds which may be putative agonists for a human 7TM 55 the 7TM receptor of the invention is also functionally 

receptor, non-mammalian, biologically active peptides for screened (using calcium, cAMP, microphysiometer, oocyte 

which a mammalian counterpart has not yet been identified; electrophysiology, etc., functional screens) against tissue 

and compounds not found in nature, but which activate 7TM extracts to identify natural ligands. Extracts that produce 

receptors with unknown natural ligands. This bank is used to positive functional responses can be sequentially subfrac- 

initially screen the receptor for known ligands, using both 60 donated until an activating ligand is isolated and identified, 
functional (i.e . calcium, cAMP, microphysiometer, oocyte 

electrophysiology, etc, see below) as well as binding assays. xamp e 

Example 3 Calcium and cAMP Functional Assays 

Ligand Binding Assays 65 jjm receptors w hich are expressed in HEK 293 cells 

Ligand binding assays provide a direct method for ascer- have been shown to be coupled functionally to activation of 

taining receptor pharmacology and are adaptable to a high PLC and calcium mobilization and/or cAMP stimulation or 
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inhibition. Basal calcium levels in the HEK 293 cells in 
receptor-transfected or vector control cells were observed to 
be in the normal, 100 nM to 200 nM, range. HEK 293 cells 
expressing recombinant receptors are loaded with fura 2 and 
in a single day>150 selected ligands or tissue/cell extracts 
are evaluated for agonist induced calcium mobilization. 
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Similarly, HEK 293 cells expressing recombinant receptors 
are evaluated for the stimulation or inhibition of cAMP 
production using standard cAMP quantitation assays. Ago- 
nists presenting a calcium transient or cAMP flucuation are 
tested in vector control cells to determine if the response is 
unique to the transfected cells expressing receptor. 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES: 2 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AGAGCAAACC CCCTCCTGCT CAGAGCTGCT GCCGCCTGCG CCCAGGGCTG CACTCCGCGC * 60 

AGGCCTCATA GCCAGGCCAT GGCCCCCACA GAGCCCTGGA GCCCCAGCCC GGGGTCAGCG 120 

CCCTGGGACT ACTCGGGGTT GGACGGCCTG GAGGAGCTGG AGCTGTGTCC GGCCGGGGAC 180 

CTGCCCTACG GCTACGTCTA CATCCCCGCG CTCTACCTGG CGGCCTTCGC CGTGGGCCTG 240 

CTGGGCAACG CCTTTGTGGT GTGGCTGCTG GCCGGGCGGC GGGGCCCGCG GCGGCTGGTG 300 

GATACCTTCG TGCTGCACCT GGCGGCAGCT GACCTGGGCT TCGTGCTCAC GCTGCCGCTG 360 

TGGGCCGCGG CGGCGGCGCT AGGCGGCCGC TGGCCGTTCG GCGATGGCCT CTGCAAGCTC 420 

AGCAGCTTCG CGCTGGCGGG CACGCGCTGC GCGGGCGCGC TGCTGCTGGC GGGCATGAGC 480 

GTGGACCGCT ACCTGGCCGT GGTGAAGCTG CTCGAGGCGA GGCCACTGCG CACCCCGCGC 540 

TGCGCGCTGG CCTCGTGCTG CGGCGTCTGG GCCGTGGCGC TGCTGGCCGG CCTGCCCTCC 600 

CTGGTCTACC GGGGGTTGCA GCCCCTGCCT GGGGGCCAGG ACAGCCAGTG CGGCGAGGAG 660 

CCCTCCCACG CCTTCCAGGG CCTCAGCTTG CTGCTGCTGC TGCTGACCTT CGTGCTGCCC 720 

CTGGTCGTCA CCCTCTTCTG CTACTGCCGC ATCTCGCGCC GCCTGCGACG GCCGCCGCAC 780 

GTGGGTCGGG CCCGGAGGAA CTCGCTGCGC ATCATCTTCG C CATC GAG AG CACGTTTGTG 840 

GGCTCCTGGC TGCCCTTCAG CGCCCTGCGG GCCGTCTTCC ACCTGGCGCG TCTGGGGGCG 900 

CTGCCGCTGC CGTGCCCCCT GCTGCTGGCG CTGCGCTGGG GCCTCACCAT TGCCACCTGC 960 

CTGGCCTTCG TCAACAGCTG CGCCAACCCG CTCATCTACC TCCTGCTGGA CCGCTCATTC 1020 

CGAGCCCGGG CGCTGGACGG GGCCTGCGGG CGCACCGGCC GCCTGGCGCG AAGGATCAGC 1080 

TCAGCCTCCT CGCTCTCCAG GGACGACAGT TCCGTGTTCC GTTGCCGGGC CCAGGCCGCG 1140 

AACACTGCCT CGGCCTCCTG GTAGAAGCTT CGGG 1174 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 



(ii) MOLECULE TYPE: protein 
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-continued 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2i 

Met Ala Pro Thr Glu Pro Trp Ser Pro Ser Pro Gly Ser Ala Pro Trp 

1 5 • ,10 15 

Asp Tyr Ser Gly Leu Aep Gly Leu Glu Glu Leu Glu Leu Cys Pro Ala 
20 25 30 

Gly Aep Leu Pro Tyr Gly Tyr Val Tyr He Pro Ala Leu Tyr Leu Ala 
35 40 45 

Ala Phe Ala Val Gly Leu Leu Gly Asn Ala Phe Val Val Trp Leu Leu 
50 55 60 

Ala Gly Arg Arg Gly Pro Arg Arg Leu Val Asp Thr Phe Val Leu His 
65 70 75 80 

Leu Ala Ala Ala Asp Leu Gly Phe Val Leu Thr Leu Pro Leu Trp Ala 
85 90 95 

Ala Ala Ala Ala Leu Gly Gly Arg Trp Pro Phe Gly Asp Gly Leu Cys 
100 105 110 

Lys Leu Ser Ser Phe Ala Leu Ala Gly Thr Arg Cys Ala Gly Ala Leu 
115 120 125 

Leu Leu Ala Gly Met Ser Val Aep Arg Tyr Leu Ala Val Val Lys Leu 
130 135 140 

Leu Glu Ala Arg Pro Leu Arg Thr Pro Arg Cys Ala Leu Ala Ser Cys 
145 150 155 160 

Cys Gly Val Trp Ala Val Ala Leu Leu Ala Gly Leu Pro Ser Leu Val 
165 170 175 

Tyr Arg Gly Leu Gin Pro Leu Pro Gly Gly Gin Asp Ser Gin Cys Gly 
1B0 1B5 190 

Glu Glu Pro Ser His Ala Phe Gin Gly Leu Ser Leu Leu Leu Leu Leu 
195 200 205 

Leu Thr Phe Val Leu Pro Leu Val Val Thr Leu Phe Cys Tyr Cys Arg 
210 215 220 

He Ser Arg Arg Leu Arg Arg Pro Pro His Val Gly Arg Ala Arg Arg 
225 230 235 240 

Asn Ser Leu Arg He He Phe Ala He Glu Ser Thr Phe Val Gly Ser 
245 250 255 

Trp Leu Pro Phe Ser Ala Leu Arg Ala Val Phe His Leu Ala Arg Leu 
260 265 270 

Gly Ala Leu Pro Leu Pro Cys Pro Leu Leu Leu Ala Leu Arg Trp Gly 
275 280 285 

Leu Thr He Ala Thr Cys Leu Ala Phe Val Asn Ser Cys Ala Asn Pro 
290 295 300 

Leu He Tyr Leu Leu Leu Asp Arg Ser Phe Arg Ala Arg Ala Leu Asp 
305 310 315 320 

Gly Ala Cys Gly Arg Thr Gly Arg Leu Ala Arg Arg He Ser Ser Ala 
325 330 335 

Ser Ser Leu Ser Arg Asp Asp Ser Ser Val Phe Arg Cys Arg Ala Gin 
340 345 350 

Ala Ala Asn Thr Ala Ser Ala Ser Trp 
3S5 360 



What is claimed is: 3. The isolated polynucleotide of claim 1 comprising 

1. An isolated polynucleotide comprising a nucleotide nucleotides 79 to 1161 of the polynucleotide as set forth in 
sequence encoding the polypeptide comprising the amino SEQ ID NO:l. 

acid sequence as set forth in SEQ ID NO:2. 65 4. An isolated expression vector comprising a polynucle- 

2. The isolated polynucleotide of claim 1 comprising the otide encoding a polypeptide having the amino acid 
polynucleotide as set forth in SEQ ID NO:l. sequence as set forth in SEQ ID NO:2. 



« 
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5. A process for producing a recombinant host cell com- 
prising transforming or transfecting host a cell with the 
expression vector of claim 4 such that the host cell, under 
appropriate culture conditions, produces a polypeptide com- 
prising the amino acid sequence as set forth in SEQ ID 
N0:2. 

- 6. A recombinant host cell produced by the process of 
claim 5. 

7. An isolated membrane of the recombinant host cell of 
claim 6 expressing a polypeptide comprising the amino acid 
sequence as set forth in SEQ ID NO:2. 

8. A process for producing a polypeptide having the amino 
acid sequence as set forth in SEQ ID N0:2 comprising 



£,052 

24 

culturing the host cell of claim 6 under conditions sufficient 
for the production of said polypeptide and recovering said 
polypeptide from the culture. 

9. An isolated polynucleotide which is fully complemen- 
s tary to the nucleotide sequence encoding the amino acid 

sequence as set forth in SEQ ID NO:2. 

10. An isolated polynucleotide which is fully complemen- 
tary to the polynucleotide as set forth in SEQ ID NO:L 

11. An isolated polynucleotide which is fully complemen- 
10 tary to nucleotides 79 to 1161 of the polynucleotide as set 

forth in SEQ ID NO:l. 

* * * * * 



